Operating System Design Principles for Scalable Shared Memory Multiprocessors
نویسندگان
چکیده
We describe SALSA an operating system that in corporates techniques for achieving scalability in large scale shared memory NUMA multiprocessors We evaluate the e ects of cache organization and caching policy on latency hiding via rapid thread switching With write back set associative caches we demon strate signi cant improvements in program perfor mance with latency hiding when cache miss latency is high even with lowmiss rates SALSA exploits clustering in a NUMA system and provides primitives for hierarchical data placement and thread scheduling We show that proper data placement can double performance in Willow a hierarchical shared memory architecture SALSA also provides user control over memory allocation for ne tuning a pro gram s memory requirements which is shown to im prove program performance by up to
منابع مشابه
Evolution of an Operating System for Large . Scale Shared . Memory Multiprocessors
Scalable shared-memory multiprocessors (those with non-uniform memory access times) are among the most flexible architectures for high-performance parallel computing, admitting efficient implementations of a wide range of process models, communication mechanisms, and granularities of parallelism. Such machines present opportunities for general-purpose parallel computing that cannot be exploited...
متن کاملSystem Software Support for Reducing Memory Latency on Distributed Shared Memory Multiprocessors
This paper overviews results from our recent work on building customized system software support for Distributed Shared Memory Multiprocessors. The mechanisms and policies outlined in this paper are connected with a single conceptual thread: they all attempt to reduce the memory latency of parallel programs by optimizing critical system services, while hiding the complex architectural details o...
متن کاملUPMLIB: A Runtime System for Tuning the Memory Performance of OpenMP Programs on Scalable Shared-Memory Multiprocessors
We present the design and implementation of UPMLIB, a runtime system that provides transparent facilities for dynamically tuning the memory performance of OpenMP programs on scalable shared-memory multiprocessors with hardware cache-coherence. UPMLIB integrates information from the compiler and the operating system, to implement algorithms that perform accurate and timely page migrations. The a...
متن کاملVirtual Clusters: Resource Mangement on Large Shared-memory Multiprocessors a Dissertation Submitted to the Department of Computer Science and the Committee on Graduate Studies of Stanford University in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy
Despite the fact that large scale shared-memory multiprocessors have been commercially available for several years, system software that fully utilizes all of their features is still not available. These machines require system software that is scalable, supports fault containment, and provides scalable resource management. Software supporting these features is currently unavailable, mostly due...
متن کاملAutomatic Computation and Data Partitioning on Scalable
Scalable shared memory multiprocessors are becoming increasingly popular platforms for high-performance scienti c computing because they both scale to large numbers of processors and support the familiar shared memory abstraction. In order to improve application performance on these machines, it is essential to divide computation among processors and to place data carefully in the distributed s...
متن کامل